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Least-squares algorithms are the fastest converging algorithms for 
adaptive signal processors, such as adaptive equalizers. The Kalman, 
fast Kalman, and adaptive lattice algorithms using a least-squares 
cost function are investigated and extended to complex, fractionally 
spaced equalizers. It is shown that, for a typical telephone channel, 
these algorithms converge roughly three times as fast as the conven- 
tional stochastic-gradient technique. We analyze and compute the 
computational complexities and demonstrate that the fast Kalman 
algorithm is the most efficient in terms of overall performance. 



I. INTRODUCTION 

Adaptive channel equalization is a widespread technique used in 
most high-speed digital data modems. Generally, a transversal filter 
with adjustable coefficients is used as the equalizer. It can be adjusted 
adaptively to compensate for the undesired intersymbol interference 
introduced by the channel. 

A large number of equalizer adjustment algorithms are conceivable, 
depending on the cost function. The currently prevailing technique is 
the so-called stochastic gradient algorithm. In the past years, three 
new rapidly converging algorithms were published, namely, the Kal- 
man, 1 fast Kalman, 2 and adaptive lattice 3 " 7 algorithms. Here, we con- 
sider algorithms which minimize the sum-of-error-squares cost func- 
tion. Because these least-squares algorithms make better use of all the 
past available information than the stochastic gradient algorithms, 
their start-up is faster. 8 

Originally the Kalman, fast Kalman, and adaptive lattice algorithms 
for equalizer update procedures were published for real- valued signals. 
In this paper, we present extensions of these algorithms to complex- 
valued signals which facilitate the analysis of quadrature-amplitude- 
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modulated (qam) data transmission formats. We also extend the 
algorithms to include fractionally spaced equalizers. 9 An important 
characteristic of each algorithm is its computational complexity which 
we analyze for the least-squares, as well as for the stochastic gradient 
algorithms. Simulation results of the equalizer start-up using least- 
squares adjustment algorithms are presented for fractionally and sym- 
bol-spaced equalizers. Quadrature-amplitude-modulated and real-life 
voice-grade transmission channels are used for this study. 

II. THE LEAST-SQUARES ALGORITHMS 

In this section, we describe extensions of the Kalman, fast Kalman, 
and adaptive lattice adjustment algorithms for the coefficient adjust- 
ment of complex, fractionally spaced equalizers. It is assumed that 
equalizer output values are computed once for each symbol interval T, 
where T denotes the time interval between successive data values in 
the transmitter. The fractionally spaced equalizer is assumed to oper- 
ate on T/p-spaced complex samples of the received signal. 

Let £(n) denote the complex p — dimensional vector of the new signal 
samples entering the fractionally spaced equalizer at time nT. Denote 
the M dimensional complex signal vector at time nT containing all 
signal samples over the past N time instances (M = Np) by 

x(n)* - [£(/i)*, i(n - 1)*, • • • £(n - N + l)*].f (1) 

Then the output of the fractionally spaced equalizer is written as 

y(n) = c(n-l)*x(n), (2) 

where c(n — 1) is the M dimensional coefficient vector which was last 
updated at the previous time instant n — 1. The desired data value at 
this instant is d(n). Therefore, an output error 

e(n) = d(n) - y(n) (3) 

results. 

The objective of the least-squares algorithms is to determine the 
coefficient vector c(n) which minimizes the weighted sum of all 
squared errors as if it were used over all the past received signal 
vectors, i.e., c(n) niinimizes 

J X n - k \d(k)-c(n)*x(k)\ 2 . (4) 

Setting the derivative of eq. (4) with respect to c(n) to zero yields the 
discrete-time, Wiener-Hopf equation 



f The * in eq. (1) denotes conjugate complex scalars and conjugate complex trans- 
posed vectors (matrices). 

1906 THE BELL SYSTEM TECHNICAL JOURNAL. OCTOBER 1981 



A(n)c(n) = v(n), (5) 

where 
A(n) = £ \"- k x(n)x(n)* + \"8I M m = XA(n - I) + x(n)x(n)*, (6) 

and 

v(n) = 2 X""*rf(n)*^(w) = Mn - 1) + *(n)d(«)*. (7) 

A small positive definite matrix 8I M m is included to ensure positive 
definiteness of A(n) for all n. For X = 1, 5 = and large n, \/n A(n) is 
an estimate of the channel correlation matrix, 1/n v(n) is an estimate 
of the cross correlation vector between the desired and the received 
signal. For X = 1, all past information is weighted equally in calculating 
an updated coefficient vector; for X < 1 the past is attenuated geomet- 
rically. Consequently, the present has a larger influence on the update 
than the past. This is a desired feature if time- varying channels are 
involved. 

Since eqs. (6) and (7) can be written recursively, the updated 
coefficient vector can be calculated recursively as follows, cf. Appendix 
A: 

c(n) = c(n - 1) + g(n)e(n)* , (8) 

where g(n) is the Kalman gain defined as 

g(n)^A(n)^x{n). (9) 

The Kalman, the fast Kalman, and the adaptive lattice algorithms all 
minimize the same cost function. 4 The difference is in the manner and 
the complexity with which it is achieved. 

The remaining part of this section contains a brief discussion of the 
Kalman and the fast Kalman algorithms. The adaptive lattice algo- 
rithm is discussed in more detail; its derivation is given in Appendix A. 
Emphasis is placed on the signal transformation which is performed 
by the lattice structure. This signal transformation permits the eval- 
uation of equalizers of increasing order in a computationally efficient 
way. The three algorithms are given in Appendix B in a form suitable 
for numerical evaluation. 

2. 1 The Kalman algorithm 

The Kalman algorithm makes use of the recursive definition of A(n) 
in eq. (6) and iteratively computes and stores its inverse A(n) _I . The 
equalizer coefficient vector is then updated according to eqs. (8) and 
(9) at each iteration. 
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While the Kalman algorithm assures rapid equalizer start-up, it has 
the disadvantage of requiring matrix operations. Therefore, the num- 
ber of calculations is proportional to M 2 and grows very fast with 
increasing M. 

2.2 The fast Kalman algorithm 

Ljung et al. 10 succeeded in formulating an equivalent algorithm with 
reduced complexity, where the number of operations is proportional 
to M . This algorithm was applied to the adaptive equalizer in Ref. 2. 
The algorithm exploits the fact that only p new signal samples enter 
the signal vector x(n),p samples are discarded, and the remaining are 
just shifted. This is accomplished by means of p X M dimensional 
forward and backward predictors for the new and discarded values. 
Recurrence equations for these predictors and, finally, for the Kalman 
gain vector can be derived based on this. At most, p X M matrices 
have to be iterated — the chief reason for the reduced complexity. 

2.3 The least-squares adaptive lattice algorithm 

Recently the adaptive lattice algorithm for a least-squares cost 
function, originally published by Morf et al., 11 was extended to equal- 
izer update applications by Satorius and Pack 3 and Shichor. 4 Its 
application to the decision feedback equalizer is reported by Shensa. 6 
Here, a further extension to the complex fractionally spaced equalizer 
is presented. A short form of this was published by Lim and Mueller. 7 

In the adaptive lattice structure, the equalizer coefficients operate 
on a transformed signal vector 

x(n) = L(n-l)x(n), (10) 

where the transformation matrix is a lower triangular matrix formed 
by the backward prediction coefficients Cm(n) of order m, m = 1 • • • 
N - 1, i.e., 

I ••• 01 

-c^n)* 70... 



L(n) = 



-cUn)* JO-. 

70 
L-cW/i)* / J 



(11) 



The backward predictor Cmin) of order m is a mp X p dimensional 
matrix satisfying 
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In eq. 12, e b (m, n) is thep X p dimensional backward prediction error 
residual of order m. For a detailed discussion of the backward predictor 
and the backward prediction error residual refer to Appendix A. In 
eqs. (11) and (12), J denotes apXp dimensional identity matrix. The 
crosses in eq. (12) denote some unspecified elements which are of 
no further interest at this time. It follows from eqs. (11) and (12) 
that A(n)L(n)* is a lower block triangular matrix. Then it follows 
that L(n)A(n)L(n)* is lower block triangular, because it is the 
product of two lower block triangular matrices. On the other hand, 
L(n)A(n)L(n)* is Hermitian because A(n) is Hermitian according to 
its definition in eq. (6). Therefore, L{n)A(n)L(n)* has to be a block 
diagonal matrix. Its diagonal element at the mth position is e fi (m - 1, 
n), i.e., 



L(n)A(n)L(n)* = diag [e b {m - 1, n)]. 



(13) 



Hence, L(n) diagonalizes A(n). 

We assume that all e b (m - l,n) are invertible and so we invert eq. 
(13). After premultiplying the result by L(n)* and postmultiplying it 
with L(n) we obtain 



A(n)- 1 = L(n)*diag [e b (m - 1, n)- x ]L(n). 



(14) 



Since it is desired that the adaptive lattice equalizer perform identically 
as the other least-squares equalizers, it follows that all the equalizer's 
output signals have to be equal, i.e., 

y(n) = c(n - l)*x(n) = c(n - l)*x(n). (15) 

From eqs. (10) and (15), it follows that the transformed coefficient 
vector c(n) has to satisfy 

c(n)=L(n)-*c(n). (16) 

The matrix L(n)~*, denotes the conjugate transposed inverse of L(n). 
Upon substituting eqs. (5) and (14) into eq. (16), we obtain 

c(n) = diag [e b (m - 1, w) _1 ]L(n)i;(n). (17) 

This suggests that the transformed coefficient vector is easily obtain- 
able from the transformed correlation vector. The equalizer output of 
order N can be written as 
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y(n) = £ z{m, n - l)*e*(m - 1, n - l)~ l e b (m - 1, n), (18) 

m=\ 



where we defined 



x(n) = 



e b (0,n) ~ 
e b (l, n) 



.e b (N-l,n)- 



, and L(n)v(n) = 



z(2, n) 



.z(N,n). 



(19) 



Note that the elements of x(n) are the backward prediction errors of 
order to N — 1. Therefore, the transformed equalizer operates on the 
backward prediction errors of order to N — 1. 

Equations (10), (11), and (17) to (19) define the adaptive lattice 
equalizer as a transform of the ordinary transversal equalizer. An 
interesting property of this transform is due to the fact that L(n) is a 
lower triangular matrix. This makes it possible to increase the dimen- 
sion of the transformed signal vector and of the equalizer in a rather 
simple way, i.e., only one new p- vector is added to the vectors of order 
m — 1 to form the vectors of order m. The already existing elements 
are unchanged. Accordingly, the equalizer output can be computed 
order-recursively in a very efficient way. 

The time update algorithm of the lattice structure makes consequent 
use of the above-described order recursions. In addition to the back- 
ward predictor, the forward predictor and its error residual are iterated. 
Only the prediction errors and the prediction error residuals of order 
zero are updated in time. Then, using these elements as an anchor, the 
prediction errors and prediction error residuals of higher order are 
obtained recursively. It turns out that the predictions themselves are 
not needed. Finally, a time update of the elements z(ra, n) of the 
transformed vector v(n) is obtained. 

This scheme also makes use of all previously received data. Theo- 
retically, its performance should be identical to the Kalman and the 
fast Kalman algorithms. Since there are no matrices involved, storage 
requirements and numbers of multiplications increase linearly with 
the equalizer length, though faster than in the fast Kalman algorithm. 

A detailed derivation of the least-squares adaptive lattice equalizer 
algorithm is given in Appendix A. There, we adopt a notation which 
allowed us to describe the equalizer, the backward and the forward 
predictors as special cases of a general least-squares problem. In 
Appendix B, we list the adaptive lattice algorithm together with the 
two other least-squares algorithms in a form suitable for sequential 
execution. 
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III. COMPLEXITY 

The number of multiplications (divisions are counted as multipli- 
cations) per iteration and the required precision are the dominant 
factors determining the complexity of real-time algorithms. 

The effect of limited precision was investigated by T. L. Lim and 
the author in earlier work on that subject. There are no major 
differences between the three algorithms. With floating-point arith- 
metic, the requirements for the mantissa are from 11 to 12 bits for 
symbol-spaced equalizers and from 13 to 15 bits for T/2-spaced equal- 
izers. 

Table I gives the number of multiplications for the three least- 
squares algorithms and for the stochastic-gradient algorithm which is 
obtained when in eq. (8); g(n), is replaced by a scalar. Results for both 
symbol- and T/2-spaced equalizers are given. The numbers for expo- 
nential weighting are included for the three least-squares algorithms. 

The gradient algorithm requires the smallest number of multiplica- 
tions, followed by the fast Kalman, the adaptive-lattice, and the 
Kalman algorithms. The gradient algorithm, of course, requires twice 
as many multiplications for the T/2 equalizer than for the symbol- 
spaced equalizer. For the Kalman algorithms, this factor is about four 
and for the adaptive-lattice, it is about five. 

The fast Kalman algorithm has the lowest complexity of all least- 
squares algorithms; it requires about five times as many multiplications 
as the gradient algorithms for symbol-spaced equalizers and ten times 
as many for T/2-spaced equalizers. The adaptive-lattice algorithm 
requires more multiplications than the fast Kalman algorithm, espe- 
cially for T/2-spaced equalizers. This is mainly because of the large 
number of matrix operations which is reflected in the large coefficient 
of p 3 . However, it was pointed out in Refs. 3 and 4 that it offers a 

Table I — Number of complex multiplications for equalizer spanning N 
symbol intervals with p samples per interval 









N=31 


N = 31 






# Multiplications 


p = \ 


p = 2 


Gradient 
Kalman 


A/l 
A = 1 


2Np 

2 N 2 p 2 + 5Np 

Np{Np + l)/2 less than for A ¥= 1 

N(p 3 + 6p)+-p :, + 2p 2 +-p 


63 
2015 
1519 


127 
7998 
6045 


Fast Kalman 


A#l 


316 


1202 




A = l 


p(p + l)/2 less than for A ^ 1 


315 


1199 


Adaptive lattice 


A*l 


7v(H p 3 + 7p 2 + ^ p ^_ 4/ ,3_ 5p 2_ 


•lp 454 


2046 




A = 1 


( N --\p(p + 1) less than for A ^ 1 


393 


1863 
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unique feature: the number of equalizer taps can be increased according 
to the actual need for the particular channel involved. Since for real- 
time applications the computational power for the longest required 
equalizer has to be provided, this cannot be regarded as an advantage 
and does not justify the considerably higher complexity for modem 
applications. 

The Kalman algorithm requires the largest number of multiplica- 
tions. Since it offers no additional features when compared with the 
fast Kalman algorithm, the latter is preferred for equalizer implemen- 
tations. 

IV. SIMULATED COMMUNICATION SYSTEM 

Figure 1 shows the simulated system, where the transmitter is 
assumed to have a raised-cosine shaped transfer function with 12 
percent excess bandwidth. Quadrature amplitude modulation with a 
symbol rate of 2400 baud and 2 bits per symbol is used. The carrier 
frequency is placed at 1700 Hz. The data symbols in the in-phase 
branch are taken from a binary pseudo random noise sequence (prns). 
The same sequence, shifted and reversed in time, is used in the 
quadrature branch. 

Various channel transfer functions were considered. Figure 2 shows 
the transfer function of a channel which barely meets the requirements 
for basic conditioning of private lines. The eigenvalue spread of the 
autocorrelation matrix for symbol-spaced samples equals 9.8. The 
equivalent baseband impulse response of the combined transmitter 
and channel is used to generate the input data for the equalizer. 
Gaussian noise of specified power is added. 



MODULATOR 



DEMODULATOR 



ST 



TRANSMITTER 




e{n) 



+dW 



Fig. 1 — Simulated transmission system. 
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Fig. 2 — Channel transfer function. 



V. INITIAL CONVERGENCE 

The initial convergence of the squared error at the output of the 
equalizer was determined for the Kalman, fast Kalman, and adaptive 
lattice equalizer structures. The behavior of the gradient algorithm, as 
well as of a fixed transversal filter with optimal coefficients were 
simulated for comparison purposes. Single precision floating-point 
arithmetic is used throughout, i.e., the mantissa is represented by 24 
bits. The s/n is 25 dB and all equalizer coefficients are initially set to 
zero. A prns with a period of 127 symbols is used for the data symbols, 
and ten simulation runs with different starting points with respect to 
the prns are averaged. The resulting curve is smoothed with an 
exponential weighting factor of 0.9 to obtain the results shown in Fig. 
3. 

Figure 3 shows the results for the channel depicted in Fig. 2. The 
sampling phase is chosen to be 25 percent of a symbol interval away 
from the optimal sampling phase. Figure 3a corresponds to a 31-tap, 
symbol-spaced equalizer and Figure 3b to a T/2-spaced equalizer also 
spanning 31-symbol intervals. The behavior of the Kalman and fast 
Kalman algorithms has been observed to be identical [the difference 
in the output mean squared error (mse) is always smaller than 0.01 
dB]. Therefore, the Kalman algorithm is not included on the plots. 

The optimal fixed equalizer attains an output mse of 23.1. dB 
normalized to the signal level. For the symbol-spaced equalizer, about 
125 iterations are required to converge to a normalized mse of 20 dB. 
For the T/2-spaced equalizer, all least-squares algorithms converge in 
about 150 iterations. The gradient algorithm requires about 400 itera- 
tions to converge to the same level. Very similar results were obtained 
for a channel with amplitude distortion as shown in Fig. 2 but with no 
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Fig. 3 — (a) Convergence of symbol-spaced equalizer. N = 31, s/n = 25 dB. (b) 
Convergence of T/2-spaced equalizer. N = 31, p = 2, s/n = 25 dB. 



phase distortion. If an ideal channel (no amplitude and no phase 
distortion) is used, the convergence time is reduced by about 35 
percent. 

These results indicate that, for realistic telephone channels, the 
least-squares algorithms behave very similarly and converge about 
three times faster than the stochastic gradient algorithm. Furthermore, 
it is found that the least-squares algorithms can be implemented 
successfully for both the symbol-spaced and T/2-spaced complex 
equalizers. Notice that the adaptive lattice algorithm requires a high 
number of matrix inversions per iteration if p > 1, which is often 
susceptible to numerical instabilities. However, our simulations did 
not uncover any stability problems. 

The inclusion of an exponential weighting factor in the sum-of- 
squares cost function was proposed in Refs. 2 and 3 to allow for the 
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tracking of time varying parameters or channels. When this was 
included in our simulations, and single precision floating-point arith- 
metic was used, an unstable behavior of the fast Kalman algorithm 
resulted. Double precision arithmetic (i.e., 56-bits for the mantissa) 
was found to eliminate the instability. The Kalman and the adaptive- 
lattice algorithms did not show this instability. 

VI. CONCLUSION 

The Kalman, fast Kalman, and adaptive-lattice algorithms are the 
fastest known methods for the training of equalizers. In particular, it 
was found that they require only about a third as many iterations as 
the gradient algorithm to converge to within 3 dB of the optimal mse. 
For a T/2-spaced equalizer and a worst-case channel, the equalizer 
start-up requires about 150 iterations. 

The fast Kalman algorithm possesses the lowest complexity of these 
schemes. It requires about ten times as many multiplications per 
iteration for a T/2-spaced equalizer as the stochastic-gradient algo- 
rithm. 

The adaptive-lattice algorithm requires more multiplications per 
update but has the advantage of being able to increase the equalizer 
length adaptively when needed. This is an advantage for off-line or 
batch processing but not for real-time applications. 

The Kalman algorithm possesses the highest complexity and offers 
no advantage over the two other schemes. Therefore, it is not recom- 
mended for implementation. 
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APPENDIX A 

Derivation of the Least-Squares Algorithms 

Let £(/i) De a p-dimensional complex vector denoting the new 
elements in the pm dimensional signal vector x m (n) 

x m (n) = [£(n)*, ■ • . £(n - m + 1)*]*. (20) 

Let c s m (n) be a complex pm dimensional coefficient vector, which 
denotes the equalizer coefficients if s = e, and let Cm(n) be a complex 
pm X m dimensional matrix which stands for the forward predictor if 
s = /and backward predictor if s = 6. 

Then the output of the equalizer, the forward and the backward 
predictors can generally be expressed as 

y s (m, n) = c s m (n - l)*x m (n). (21) 
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The double argument (m, n) denotes order m and time n. The desired 
signal d s (m, n) is defined as 

(a(n — D) for s = e 
£(n + 1) for s = f . (22) 

£(/i — m) for s = b 

The transmission delay between transmitter and receiver is denoted 
by D. The equalization and prediction errors e 8 (m, n) are defined as 

e s (m, n) = d s {m, n) - y s (m, n) . (23) 

For s = e, y s (m, n), d s {m, n) and e s (m, n) are complex scalars, for s ¥* 
e they are p— dimensional, complex vectors. 

The equalizer and prediction coefficients are determined such that 
they minimize the trace of the following least-squares cost function 

l X n - k [d s (m, k) - c s m (n)*x m (k)][d s (m, k) - c 8 m (n)* x m (k)]* . (24) 

Lambda is a geometric weighting factor. Differentiating the above cost 
function with respect to c 8 m (n) and equating the resulting expression to 
zero yields the discrete-time, Wiener-Hopf equation for the coefficients 

A m (n)c s m (n) = v s m {n), (25) 

where 

A m (n) = £ \ n - k x m {k)x m (k)* = XA m (n - 1) + x m (n)x m (n)* (26) 
vin(n) = i X n - k x m (k)d s (m, k)* = Xv s m (n - 1) 

k=0 

+ x m (n)d s (m,n)*. (27) 

In eq. 27, A m (n) is an mp X mp dimensional, Hermitian matrix, and 
v 8 m (n) is a mp X p dimensional complex matrix. They are equivalent to 
the autocorrelation matrix and the cross-correlation vectors which 
occur in the familiar mean-square approach. 

The optimal value of the cost function is obtained when the solution 
resulting from eq. (25) is substituted into eq. (24) 

€ s (m, n) = E*(m, n) - v' m (n)* c' m (n) , (28) 

where 

E s (m, n) = £ X n - k d s (m, k)d s (m, k)* 

= XE s (m, n-l) + d 8 (m, n)d 8 (m, n)* . (29) 
For s = e, e s (m, n) is a scalar. For s =£ e, it is ap X p Hermitian matrix. 
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A. 1 Time update recursions 

We observe that eqs. (26) and (27) contain a recursive definition of 
A m (n) and v" m (n). This allows us to obtain a recursion in time for the 
optimal coefficients. Upon combining eqs. (25) and (27) we have 

A m (n)c B m (n) = XA m (n - l)c 8 m (n - 1) + x m (n)d"(m, n)*. (30) 

Add and subtract x m (n)x m (n)*c s m (n - 1) to the right-hand side of eq. 
(28), then use eq. (23), the recursive form of eq. (26), and premultiply 
both sides with A m (n)~ } to obtain the desired recursion 

c s m (n) = c s m (n - 1) + A m (nr l x m (n)e 8 (m, n)* . (31) 

To obtain time update recursions for various auxiliary variables, we 
consider c s m (n)*u' m (n), where s and t e {e, b, f] . From the time update 
recursion for the coefficients eq. (31), we conclude that 

c s m (n)*v' m (n) = [c a m (n - 1)* + e s (m, n)x m (n)*A m (n)- l ]v' m (n). (32) 

We multiply out and use eq. (27) for the first term. In the second term, 
we apply eq. (25) and obtain 

c 8 m (n)*v' m (n) = \c B m (n - l)*v' m (n - 1) 

+ c s m (n - l)*x m (n)d'(m, n)* + e a (m, n)Xm(n)*cUn) . (33) 

Now add and subtract d"(m, n)d'{m, n)* and use eqs. (21) and (24) to 
obtain 

c 8 m (n)*v' m {n) = Xc a m (n - D*v' m (n - 1) 

+ d s (m, n)d'(m, n)e s (m, n)[x m (n)*c' m (n) - rf'(m, »)]. (34) 

From eq. (31) it follows that the error after updating the coefficients 

e'(m, n)* = -x m (n)*c' m (n)+d'(m, n) = (l-y(m, n))e l (m, n)*, (35) 

where we defined the real scalar 

y(m, n) = x m (n)*A m (n)~ 1 x m (n) . (36) 

Finally, we obtain from eqs. (34) and (35) 

c s m (n)*v' m (n) = Xc a m (n - l)*v' m (n - I) + d'(m, n)d'(m, n) 

- e'(m, n)e'(m, n)*[l - y(m, »)]. (37) 

A time recursion for e'(m, n) can be obtained from eq. (28) by using 
eqs. (28), (29), and (37) 

e 8 (m, n) = \€ 8 (m, n - 1) + [1 - y(m, n)]e s (m, n)e s (m, n)* . (38) 

The mth component of the transformed correlation vector is defined 
as, cf. eqs. (11) and (19) 
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z(m, n) = [-c b m -i(n)*, r\v e m (n) 

= -cJL.iO»)*i&-i(n) + I \ n -"Z(k + 1 - m)d e (m, k)\ 

We note that d e (m, k) = d e (m - 1, k), and within apply eq. (37) and 
obtain the time recursion 

z(m, n) — Xz(m, n — 1) 

+ [1 - y(m - 1, n)]e b (m - 1, n)e e (m - 1, n)* . (39) 

For future use, we define the p X p matrix 

k(m - 1, n) m £ A n+I -*|(* - m)£(k)* - ^-i(»)*ci»-i(n). 

We apply eq. (37) and observe that d f (m — 1, n) = £(n + 1) and that 
rf*(m - 1, n) = £(n — m + 1). Thus, we obtain 

k(m — 1, n) = Xk(m — 1, n — 1) 

+ [1 - y(m - 1, n)]e 6 (m - 1, n)e f (m - 1, n)* . (40) 

4.2 Order update recursions 
Observe that from eq. (20) it follows 



x m+i (k) = 



Xm(k-l) 



X m (k) 



Z(k - m) 



(41a, b) 



This relation is the order-time update equation for the signal vector. 
It allows the derivation of order-time update equations for the various 
coefficient vectors, for the error values and for the error residuals. 
Update equations for various auxiliary variables necessary for the 
algorithm are also derived. 

From eq. (41) and the definitions eqs. (26) and (27) and under the 
condition that x m (0) = it follows 



A m+ An + 1) = 



E f (m, n) 
y m (n) 


v f m {n)* 
A m (n) 


\A m (n + 1) 
v b m (n + l)* 


v b m (n + l) 
E b (m, n + 1) 



(42a, b) 



Upon combining eq. (26) and eq. (42a, b) we obtain the augmented 
Wiener-Hopf equations 
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A m+ i(n + 1) 

/ 



-of 



cUn) 



-c b m (n + 1) 


= 


re f (m, n) 





/ 



e b (m,n+ 1). 



(43a, b) 



where we used partitioned matrices to represent the two systems of 
equations for the forward and the backward predictors, having the 
same matrix of coefficients. 

Similar equations can be derived for predictors of reduced order 



Am+i{n + 1) 



I 





-c f m -i(n) 


-c b m -i(n) 





I 



-\ -| e f {m — 1, n) 





k f (m — 1, n) 



k b (m- l,n)-\ 





€ b (m-l, n)\ (44a, b) 



Equation (44a) can easily be verified by applying the expansion of 
(42b) with reduced order and time indices to A m (n) in eq. (42a). The 
same method with the order reversed verifies eq. (44b). 

The auxiliary variables k f (m — 1, n) and k b (m — 1, n) are defined as 



k f (m - 1, n) = £ A" +1 -*£(A - m)£(k)* - v^i(n)* c f m -i(n) (45a) 
k b (m - 1, n) - J \ a+l - k S(k)£{k - m)* - v f m -M)* c b m -M) . (45b) 



From eq. (45a, b) and with the definition of the predictor coefficients 
from eq. (25), it is easily verified that 

k f (m - 1, n) = k b (m - 1, n)* = k(m - 1, n). (46) 

The order update equations for the predictor coefficients and for the 
error residuals are now obtained through the combination of eqs. (43) 
and (44). We consider that 

(43a) = (44a) - (44b)e 6 (m - 1, n)' x k(m - 1, n) 



and 



(43b) = (44b) - (44a) e f (m - 1, n)~ l k(m - 1, n)* . 



This, with n reduced by one, yields an update equation for the 
prediction error residuals 



ADAPTIVE EQUALIZERS 1919 



e f (m, n — 1) = e f (m — 1, n — 1) 

- A-(m - 1, n - l)*e"(m - 1, n - ir l k(m - 1, n - 1) (47a) 
e ft (m, n) = e 6 (m - 1, n - 1) 

- £(m -l,n- \)e f (m - 1, n - \)~ x k{m - 1, n - 1)*. (47b) 

We assume now that A m +i(n + 1) is nonsingular and premultiply eqs. 
(43) and (44) by its inverse. The same combinations of the new 
equations yield update equations for the coefficients. 



c f m (n) = 



c b m (n + 1) = 



c f m An) 



- 


'C b m -M)~ 
-I 



■ € b (m- l,n)' l k(m- 1, n) 



(48a) 






- 


-I 1 


_c b m -M) 


c f m -i(n) 



■ € f (m- l,n)~ l k(m- 1, n)* 



(48b) 



We premultiply eq. (48a) with x m (n + 1) and eq. (48b) with 
x m (n + 2)*. This yields eqs. (41), (21), and (23) 

x m (n + \)*c f m (n) - x m -i(n + l)*c f m -i(n) - [x m -i(n + 1)* 

•c b m -i(n) - £(/i - m + 2)*]e b (m - 1, ny l k{m - 1, n) (49a) 
x m (n + 2)*c b m (n + 1) = x m -i(n + l)*c b m -i(n) - [x m -i(n + 1)* 

• c f m -An) - £(n + 2)*]e f (m - 1, ny x k(m - 1, n)* . (49b) 

With eq. (23) we identify the terms in the bracket of eq. (49a) as e b 
(m - 1, n + 1)* and in the bracket of eq. (49b) as e f {m - 1, n + 1)*. 
We transpose eq. (49a), decrease n by 2, and use eq. (23) to obtain the 
update equations for e f (m, n — 1). The update equation for e b (m, n) 
is obtained similarly. 

e f {m, n — 1) = e f (m — 1, n — 1) 

-k(m-l,n- 2)*e b (m - 1, n - 2) _1 e fe (m - 1, n - 1) (50a) 
e b (m,n) - e b (m - 1, n - 1) 

-k(m,n- 2)e f (m, n - 2) _1 e / (m, n - 1). (50b) 

The Kalman gain g m+ i(n) is defined by 

A»+i(n)£m+i(n) = x m +i(n). (51) 

From eq. (40) we deduce 
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A m+X (n) 




g m (n - 1) 



gmin) 


v f m (n-l)*g m (n-l) 



x m (n) 



v b m (n)*g m (n) 



(52a, b) 



x m (n - 1) 

We note from eq. (41) that eqs. (51), (52), and (43) are related as 
follows 
(51) = (52a) + (43a)c /r (m, n - l) _1 [|(/i) - v f m {n - l)*g m (n - 1)] 

(51) = (52b) + (43b)c 6 (m, n) -1 [£(/i - m) - v b m (n - l)*g m (n - 1)]. 

We identify the terms in the brackets as the forward and backward 
prediction errors after updating the coefficients eq. (35). Performing 
the above-defined linear combinations of eqs. (43), (51), and (52) and 
premultiplying with Am+i(*t)~\ yields order update equations for the 
Kalman gain. 



gm+i(n) = 



g m +\(n) = 







g m (n-l) -c f m (n- 1) 

•e f (m, n — l)~ x e f {m, n — 1) 



~gm{n)~\ 



+ 


~-c b m (n)~ 


I 



(53a) 
€ b (m,n)- l e b (m,n). (53b) 



We now proceed to obtain order update equations for y(m, n) as 
defined by eq. (36). Note from eqs. (36) and (51) that 

y(m, n) = x m (n)*g m (n). (54) 

Upon using eqs. (41a), (53a), (41b), and (53b) respectively, we obtain 

y(m, n) = y{m - 1, n - 1) + e f (m - 1, n - 1)* 



• € f (m - 1, n - \y l e f {m - 1, n - 1) 
y(m, n) = y(m - 1, n) + e b (m - 1, n)* 

-e b (m- l,n)~ l e b (m-l,n). 



(55a) 
(55b) 



APPENDIX B 

Least-Squares Equalizer Update Algorithms 

Here the least-squares equalizer update algorithms are listed and 
ordered such that they can be evaluated in the given sequence. 
Emphasis is put on a simplified notation compared to Appendix A. 
Generally, capitals denote matrices and lower case letters denote 
scalars and vectors. Table II gives the correspondence of variables and 
their dimensions, where M = Np. 
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Table II — Correspondence of variables 



Variable 


Appendix A 


Appendix B 


Dimension 


Signal vector 


x m (n) 


x(n) 


M 


Correlation matrix 


A m (n) 


A(n) 


MxM 


Equalizer coefficients 


c e m (n) 


c(n) 


M 


Equalizer error 


e e (m, n) 


e(m, n) 


1 


Forward prediction 








Coefficients 


cUn-1) 
e'im, n — 1) 


F(n) 


Mxp 


Error 


f(m, n) 


P 


Error residual 


e f (m, n - 1) 


E f (m, n) 


pxp 


Backware prediction 








Coefficients 


c b m (n) 


B(n) 


Mxp 


Error 


e"(m,n) 


b(m, n) 
E h (m, n) 


P 


Error residual 


e b (m,n) 


pxp 


PARCOR coefficient 


k(m-l,n-l) 


K(m, n) 


pxp 


Kalman gain 


gm(n) 


g(n) 


M 



B. 1 The Kalman algorithm 

The Kalman algorithm makes use of the recursive definition of A (n) 
in eq. (26) and the matrix inversion lemma, i.e., 



A(»r-i 



. A(n-ir l x(n)x(n)*A(n-lY 
A(n — 1) — 



(56) 



X + x(n)*A(n- l)~ l x(n) 

and defines 

P(n)=A(n)- 1 . (57) 

Upon using eqs. (2), (3), (8), (56), and (57) we obtain the Kalman 
algorithm for equalizer updating 

t(n)=P(n- l)x(n), (58) 

g{n) = t(n)/(\ + x(n)*t(n)), (59) 

P(n) = {Pin - 1) - g(n)t(n)*] - , (60) 

y(n)=c(n-l)*x(n), (61) 

e(n) = d(n)-y(n), (62) 

c(n) = c(n - 1) + g(n)e(n)*. (63) 

To initialize, set all variables to zero except P(0) which is set to 
p(0) = 1/5 J. Note that because of the Hermitian nature of A (n) and 
consequently of P(n) = A(n)~\ eq. (16) needs only be evaluated for 
the upper (or lower) triangle including the diagonal. 

B.2 The fast Kalman algorithm 

To obtain the fast Kalman algorithm, apply eqs. (21) to (23), (31), 
(35), and (38) for the forward predictor of fixed order m. This yields 
eqs. (64) to (67) 
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f(n) = £(/i) - F(n - l)*x(n - 1) (64) 

F(n) = F(n - 1) + g(n - l)f(n)* (65) 

f(n)' = f(n)[l -g(n-l)*x(n- 1)] (66) 

E(n) =\E(n-l)+ f(n)'f(n)*. (67) 

Then use eq. (53a) to calculate the extended Kalman gain g(n) and 
partition as indicated 



g(n) = 



E(n)- l f(n)' 



g(n)' 
fi(n) 



(68) (69) 



g(n - 1) - F{n)E(n)-'f(n)' J 

Now the backward prediction error b(n) is calculated from eq. (23) 

b(n) = £(n-N)-B{n-l)*x(n). (70) 

Equations (31) and (53b) can now be used to update the backward 
predictor and finally to determine the updated Kalman gain 

B{n) = [B(n - I) + g(n)'b(n)*][I P p- Hin)b(n)*]- 1 (71) 

g(n)=g(n)' + B(n)ii(n). (72) 

Equations (21) to (23) and eq. (31) applied to the equalizer conclude 
the algorithm. 



To initialize, set 



and 



y(n) = c(n - l)*x(n) 

e(n) = d{n) - y{n) 

c(n) = c(n- 1) +g(n)e(n)*. 

F(0) = B(0)=0m p 
x(0)=g(0) = c(0)=0 M 



(73) 
(74) 
(75) 



E(0) = 8I PP . 

Notice for the numerical evaluation that E is Hermitian and that the 
matrix inversions in eqs. (68) and (71) can be avoided if a 
p— dimensional system of linear equations is solved for multiple right- 
hand sides. 

B.3 The lattice algorithm 

For each time instant, the algorithm is initialized for order zero 

y(0, n) = y(0, n) = (76) 

c(0,n) = d(n) (77) 
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f(0,n) = b(0,n)=£(n) (78) 

E f (0, n) = E b (0, n) = \E f (0, n - 1) + £(/i) £(«)*. (79) 

From eq. (40), the following time update equation follows 

K(m, n) = \K{m, n - 1) + t(m, n - 1) f{m - 1, n)* . (80) 

Then we obtain from eq. (50) order update equations for the prediction 
errors 

f(m, n) = f(m - 1, n) - G(m, n - l)b(m - 1, n - 1) (81) 

b(m, n) = b(m - 1, n - 1) - H(m, n - l)f{m - 1, n), (82) 

where auxiliary p X p matrices are determined as 

G(m, n) = K(m, n)*E b (m- 1, n - 1) _1 (83) 

H(m, n) =K(m, n)E f (m- 1, n)" 1 . (84) 

Equation (48), together with eqs. (83) and (84), permits the update of 
the prediction error residuals 

E f (m, n) = E f (m - 1, n) - G(m, n)K(m, n) (85) 

E b (m, n) = E"(m - 1, n - 1) - H(m, n)K(m, n)* . (86) 

The equalizer output and output error follow 

y(m, n) = y(m — 1, n) + z(m, n — 1)* 

■ E h (m- 1, n- l) _1 fe(m- 1, n) (87) 

e(m, n) = d(n) — y(m, n). (88) 

From eqs. (35) and (55), we have 

t(m, n) = [1 - y(m - 1, n)]b(m - 1, n) (89) 

y(m, n) = y(m - 1, n) + t{m, n)*E b (m - 1, ny x t{m, n). (90) 
Equation (39) finally allows to update the coefficients 

z{m, n) = Xz(m, n — 1) + t(m, n)e(m — 1, n). (91) 

Equations (87), (88), and (91) are evaluated for me[l, N]. The other 
equations are evaluated for me[l, N — 1]. To initialize, set all variables 
to zero except £'(0, 0) = E b (0, 0) = 8I PP . 

For the numerical evaluation, it should be noted that E f (m, n) and 
E b (m, n) are Hermitian, thus, only the real diagonal and the upper or 
the lower triangle need be computed. Note also, that G(m, n) and 
H(m, n) are computed best as the solution of a p— dimensional system 
of linear equations with p right-hand sides. 
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